A Pythagoras proof of Szemeredi's regularity lemma 

Notes for our seminar — Alexander Schrijver 

Abstract. We give a short proof of Szemeredi's regularity lemma, based on elementary geometry. 

The 'regularity lemma' of Endre Szemeredi [1] roughly asserts that, for each e > 0, there 
exists a number k such that the vertex set V of any graph G = (V, E) can be partitioned 
into at most k almost equal-sized classes so that between almost any two classes, the edges are 
distributed almost homogeneously. Here almost depends on e. The important issue is that k 
(though generally extremely huge) only depends on e, and not on the size of the graph. The 
lemma has several applications in graph and number theory, discrete geometry, and theoretical 
computer science. 

We give a short proof based on elementary Euclidean geometry. The general line of the 
Q ' proof is like that of the standard proof (in fact, Szemeredi's original proof), but most of the 

technicalities are swallowed by Pythagoras' theorem. We prove two lemmas, one on 'e-balanced' 
partitions, the other on 'e-regular' partitions. 

Let V be a finite set. A partition of V is a collection of disjoint nonempty sets (called 
classes) with union V. Partition Q of V is a refinement of partition P if each class of Q is 
contained in some class of P. For e > 0, partition P of V is called e-balanced if P contains a 
subcollection C such that all sets in C have the same size and such that \V \ [J C| < s\V\. 
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Lemma 1. Each partition P of V has an e-balanced refinement Q with \Q\ < (1 H- ) | J-^ | . 



Proof. Define t := e|V|/|P|. Split each class of P into classes, each of size |~t~|, except for at 
most one of size less than t. This gives Q. Then \Q\ < \P\ + \V\/t = (1 + e _1 )|-P|. Also, the 
union of the classes of Q of size less than t has size at most \P\t = e\V\. So Q is e-balanced. | 
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Let G = (V, E) be a graph. For nonempty I, J C V, the density d(I, J) of (I, J) is the 
number of adjacent pairs of vertices in / x J, divided by \I x J\. Call the pair (J, J) e-regular 



(1) if \X\ >e\I\ and \Y\ > e\J\ then \d(X, Y) — d(I, J)\ <e. 
A partition P of V is called e-regular if 

(2) £ \I\\J\<e\V\ 2 . 

I,J£P 
{I, J) £-irrcgular 

For Lemma [2] we need the following. Consider the matrix space M. VxV , with the Frobenius 
norm ||M|| = Tr(M T M) 1 / 2 for M G R VxV . For nonempty I, J C V, let L^j be the 1- 
dimensional subspace of R^ x ^ consisting of all matrices that are constant on / x J and 
outside I x J. For any M E M l/x ^, let Mj : j be the orthogonal projection of M onto Lj j. So 
the entries of Mj j on I x J are all equal to the average value of M on / x J. 
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If P is a partition of V, let Lp be the sum of the spaces Lj j with I,JeP, and let Mp 
be the orthogonal projection of M onto Lp. So Mp = j & p Mi t j. Note that if Q is a 
refinement of P, then Lp C Lq, hence ||Mp|| < ||Mq||. 

Lemma 2. Let e > and G = (V, £7) &e a graph, with adjacency matrix A. Then each 
e-irregular partition P has a refinement Q with \Q\ < \P\S P \ and ||^4q|| 2 > ||^4p|| 2 + e 5 |y| 2 . 

Proof. Let (Ii, J\), . . . , (J n , J n ) be the e-irregular pairs in P 2 . For each % = 1, . . . , n, we can 
choose (by definition (JTJ)) subsets X; L C 1^ and Y{ C Jj with \Xi\ > e\Li\, \Yi\ > e\Ji\ and 
\d(Xi,Yi) — d(Ii,Ji)\ > e. For any fixed K £ P, there exists a partition of K such that 
each with Ii = K and each Yi with Ji = K is a union of classes of Qk and such that 
\Qk\ < 2 2|p| = 4l p l. Let Q := \J KeP Q K - Then Q is a refinement of P such that each 
and each 1^ is a union of classes of Q. Moreover, \Q\ < \P\S P \. 

Now note that for each i, since {Ao)x l ,Y i = Ax^Y* (as Lx % y % C Lq) and since Ax % y i and 
^4p are constant on x Yj, with values d(Xj, Yi) and Ji), respectively: 

(3) ||(^ Q -A P ) Xj) yJ| 2 = \\A XiXi -{A P ) XiXi \\ 2 = {XiWYKdiX^-d^,^)) 2 > e%\\Ji\. 

Then negating (J2]) gives with Pythagoras, as Ap is orthogonal to Aq — Ap (as Lp C Lq), and 
as the spaces Lx i y l are pairwise orthogonal, 

n n 

(4) \\ Aq \\ 2 -\\Ap\\ 2 = \\A Q -Ap\\ 2 > £ ||(A3-.4p) Xi , y J 2 > £> 4 |Iill^l > ^| 2 - I 

i=l i=l 
Define / e (x) := (1 + e~ 1 )x4 x for e, x > 0. For n G N, /" denotes the ra-th iterate of / £ . 
Szemeredi's regularity lemma. For each e > and graph G = (V,E), each partition P of 
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V has an e-balanced e-regular refinement of size < / e ((1 + e~ )|P|). 

Proof. Let A be the adjacency matrix of G. Starting with P, iteratively apply Lemmas Q] 
and [2] alternatingly. At each application of Lemma [H ||Ap|| 2 does not decrease, and at each 
application of Lemma [21 ||^4p|| 2 increases by more that e 5 |y| 2 . Now, for any partition Q of 
V, ||^4q|[ 2 < ||^4|| 2 < l^| 2 - Hence, after at most [ £ ~ 5 J iterations we must have an e-balanced 
e-regular partition as required. I 

We note that if P is an e-balanced e-regular partition of V, and C C P is such that all 
sets in C have the same size and such that \V \ [JC\ < s\V\, then the number s of e-irregular 
pairs in C 2 is at most e(l — e)~ 2 |C| 2 . For let t be the common size of the sets in C. Then, by 
©, st 2 < e\V\ 2 < e(l - e)" 2 | U C\ 2 = e(l - e)- 2 (t|C|) 2 = e(l - e)~ 2 |C| 2 t 2 . 
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2 For any collection C of subsets of a finite set S, there is a partition R of S such that any set in C is a union 
of classes of R and such that \R\ < 2 ,c| : take R := {f] xeD X n f] Y ec\D S \ Y \ D (Z C} \ {0}. 



2 



